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ABSTRACT 

A lightweight acceleroaeter has been used to produce 
a wavefora related to the glottal acoustic output when attached to 
the throat of a speaker, and to provide an indication of acoustic 
coupling to the nasal cavities when attached to the external surface 
of the nose, Exaaples of signals produced by the acceleroaeter are 
shown, and possible application in speech training are discussed, 
f Author) 
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The Use of a Miniature Accelerometor for Detecting 
Glottal Waveforms and Nasality'^ 

K. N. Stevens, D. N. Kalikow 
and T. R. Willemain 



Abstract 

A lightv;eight accolerop.eter has been used to 
produce a waveform related to the glottal acoustic 
output when attached to the throat of a speaker, 
and to provide an indication of acountic coupling to 
the nasal cavities when attached to the external 
surface of the nose. Examples of signals produced by 
the accelerometer are shov/n, and possible applications 
in speech training are discussed. 
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Thifj note describes the use of a lightweight trannducor to 
produce a waveform reprcr.entativo of the glottal acouctic output 
when attached to the throat of a speaker, and to provide a meaoure 
of the degree of acoustic coupling to the nasal cavities through 
the velopharyngeal opening when attached to the external surface 
of the nose. 

The transducer is a miniature accelerometer (Bolt Beranek 
and Nev/man Model 501) . It is housed in a cylindrical casing of 
height 0.330 inch and diameter 0,312 inch, and its total v/eight 
is 1.8 grams. The casing contains both a pi«.v7 3.-;lectric transducer 
and a preamplifier cir::uit. Figure 1 shov;s tho accelerometer 
attached to the throat and to the nose, and il..«3trates the small 
size of the device. Nominal sensitivity of th unit is 8 mv/g . 
The frequoncy response is flat to within 5% over the range 8 to 
20,000 Hz. The acceleroneter can withstand shocks of 10,000 g's 
without damage. The preamplifier requires 1 ma. of current at 
9 v.d.c. These features of light v.'oight, good sensitivity, wide 
bandv;idth, ruggcdncss and low power consumption make the device 
suitable in applications where speech-related signals need to be 
extracted and displayed, such as speech-reception aids for the 
deaf, speech training of the deaf or of cleft-palate patients, 
training of pronunciation of a second language, or even speech 
pick-up in high-intensity noise environments, since the accolcr- 
ometer is relatively insensitive to airborne sound. 

The accelerometer may be easily attached to the throat or 
nose of a speaker by means of double-sided adhesive tape. Once 
In place, the accelerometer is barely noticeable; subjects can 
wear it for hours without discomfort. 
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Pig* !• Showing the accelerometer in position for detection 
of pitch and nasality. 
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Glottal V7avoform 

« 

One application of a throat-attached accelorometer is to 
obtain a waveform from which glottal periodicity or f undamontal 
frequency can be conveniently extracted. Derivation of fundamental 
frequency is more complex with the signal derived from a microphone 
mounted near the mouth, since this signal has a richer harmonic 
structure and may include interference from background noise. 
While real-time pitch extraction can be achieved from voice signals 
alone, a procedure based on throat-derived signals is simpler and 
more reliable. The wavefonn from the acccleromcter is also po- 
tentially useful in basic studies of speech production and, perhaps 
in diagnosis of abnormalities of the larynx or in the subglottal 
air\-;ays (since an acoustic resonance of the subglottal system can 
be observed in the wavefrrra, as noted belov/) . Other kinds of 
throat-attached sensors have been used for the purpose of obtaining 
a harmonic-free glottal waveform, but in general these liave been 
less convenient and more bulky than the accelerometor described 
here (Sugimoto & Hiki, 1960; Porter, 1963; Tjernlund, 1964; 
Fourcin & Abberton, 1971) . 

Figure 2 shows the simultaneous throat accelerometor and 
microphone vraveforms of a male speaker uttering several vowels. 
The voice signal was recorded using an Electro-Voice RE 15 
microphone at a distance of 12 inches from the speaker. The 
accelerometor provides a waveform with relatively little harmonic 
content and fairly uniform amplitude. For these utterances, the 
acccleromcter was mounted in the mid-sagittal plane midway 
between the thyroid cartilage and the sternal notch.* The time 
lag between the accelerometor waveform and the output of the voice 
microphone is about 1.5 msec, approximately the time needed for 
sound to propagcitc from the glottis to the microphone. The 
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Fig. 2. Accelerometer output (upper trace) and sound-pressure 
waveform (lower trace) for a male speaker producing 
various sustained vowels. Fundamental frequency is 
118 Hz. 
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ripple visible on the accoloro:notor output har. a frequency of 
about 500 Hz, irrcnpGctivc of vowel; its amplitude depends 
somewhat on the vowel. ObEerv.itions of wavcforna recorded from 
other speakers conf.lrir* the prenencc of n similar ripple whoco 
frequency is vowel- independent but speaker-dependent, the fre- 
quency of the ripple for females being higlicr than that for males. 
This ripple is presumably the result of acoustic excitation of 
the lov;oGt natural frequency of the subglottal system, which is 
known to bo in the vicinity of 500 Hz (Fant, et al., 1972)2. 
is apparent from Fig, 2 that the accolerometer gives a peaked 
waveform that can be used with simple signal processing techniques 
to provide an indication of glottal periodicity. 

Experiments were pcrform.ed to determine the amplitude of the 
accelerometor output and the amount of harmonic content as a 
function of the mounting position on the throat. The results 
indicate that a point midway bctv;ecn the thyroid cartilage and 
the storn^il notch and in the mid-saggital plane gives v;avoformr. 
with maximal amplitude and minimal harmonic content. VJith the 
accelerometor in this location, there is little variation in 
waveform from one vov;el to another. The exact positioning of the 
accelerometor in the vicinity of this point is not critical. 

The throat-attached accelcrom.cter , together with frequency- 
measuring and interfacing circuitry, is currently being used in a 
computer-based system of speech-training aids for deaf children 
(Nickorson & Stevens, 1973) , and in a similar system for the 
training of pronunciation of a second language (Kalikow & Swots, 
1972). The pitch extractor operates reliably over a range of 
subjects including adult males, adult females, normal -spealcing 
children and deaf children. 
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Wanality Dctoction 

When, during voiced r.poech (i.e., during portions of utterances 
produced with the vocal cords vibrating) , the speaker lowers his 
velum, acoustic energy onannting from the glottis is modified by 
both the mouth and nasal cavities. The effects of the nasal cav- 
ities on the characteristics of the acoustic speech wave as 
transduced by a microphone are subtle, whereas the signal at the 
noGC provides a more direct indication of a lowered velum. The 
acoustic energy propagated through the velar opening into the 
nasal cavities is signaled unequivocally by an increase in the 
vibration of the skin of the nostril. 

To provide an instantantous indication of this correlate of 
vclopliaryngcal opening, the output of the accelcromotcr attached 
to the noso"^ is filtered, rectified, and smoothed v;ith an averaging 
time of about 20 msec. V7ith the aid of a computer, this smoothed 
signal is sampled at 10-mscc intervals, log-converted, and displayed 
on an oscilloscope. 

The degree to which a display of this type can indicate 
differences between nasal and non-nasal speech sounds is illustrated 
in Fig. 3. This figure shows the display of the nasal accelerometor 
output for three phrases produced by a normal adult male speaker: 
one phrase ( his father ) contains no nasal consonants, another ( hi s 
inothor ) contains one nasal, and a third (my money ) contains several 
nasal consonants. Quantitative measurements have shown that tho 
output for nasal sounds is 10-20 dB above that for non-nasal 
vowels (Stevens, et al. , 197-1). Tho difference is greatest for 
non-nasal, non-high vowels, and is as small as 10 dB only for the 
non- nasal vowel /i/. The curves clearly show t2io regions of the 
speech in which there is a velopharyngeal opening and demonstrate 
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Fjlg. 3. Computer-derived and -displayed "nasality function" 
for throe phrases produced by a normal male speaker: 
(a) "his father/' (b) "his mother," and (c) "my money." 
See text. 



8 

ERIC 



Report No. 2907 



Bolt Bovanek and Newman Inc. 



BEST COrV AV/;il/;Sl'£ 

the wcll-]:nown f.-ict that tho naiializntion for a nnsal consonant 
extends into vov,'c1l; adjacent to the connonant. 

A display of the typo sho;s"n in Fig. 3 can be used, therefore, 
to indicate presence or a);Konce of nasality in npeech sounds, 
vhore by nasality v;o mean the presence of a velopharyngeal opening 
sufficient to cause substantial acoustic coupling betv:eon the 
oropharyngeal and nasal cavities. Tlie question of whether a 
measure obtained from this display correlates \7ith subjective 
judgments of nasality^ is, of course, one that cannot easily be 
resolved, in view of the long history of difficulties in relating 
physical measures to nasality judgments. (See, for example, 
Shelton, ct al., 1967; Lubhor and .Moll, 1965; House and ntevens, 
1956; Colton and Cooker, 1968.) 

Figure 4 shows the nasality display for a deaf child with 
r-overe problems in controlling the velum; for comparison, the 
display is also shown for a norj.ial speaker producing the sane 
sentence. The sentence for the deaf child is longer than that 
for the normal speal:er. Some vov/els are too nasal (the vowels - 
in you and your ) , and some nasal consonants are replaced by 
stops (the nasal consonants in drin): and milk) . 

This type of nasality display has been incorporated in the 
above-mentioned system of speech-training aids for deaf children. 
Provision- has been made for sotting a nasality "threshold" on 
the display under the control of a teacher, to provide some kind 
of "target" for the student to aim for. The student attempts to 
maintain the contour below this threshold during a non-nasal 
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(a) 



(b) 



•V 




Fig. 4. Nasality display for the sentence, "You can drink your 
' milk," produced by: (a) a deaf child with problems of 

velar control, and (b) a normal adult male speaker. 

The five syllables in the sentence are indicated above 

each version. See text. 
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utterance or during the non-nanal portion of an utterance. The 
system alr.o pemitn the teacher's nasality trace to be stored on 
one part of tlio dii^play, to allov; •<'isual comparison of the 
utterances of student and teacher. The 'i splay is proving to be 
successful in indicating to a deaf student when his velopharyngeal 
port is closed. Training procedures are being devised and 
expcrincnts arc in progress to detornino hov; a display of the 
type shown in Figs. 3 and 4 can bo most effectively used for 
speech training v;ith deaf children v;ho have inadequate control of 
the vclun but v.'ho presuniab3y have an intact neuromuscular system 
for manipulating the velum. 
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NOTES 

1« This work v;as supported by the U. S. Office of Education, 
Media Services and Captioned Films Rranch of the Bureau of 
Education for the Handicapped, under Contract Ko. OEC-0-71- 
4670(615), and the Advanced Research Projects Agency under 
AFOSR Contract No. F44620-71-C-0065. Progrcmminrj and other 
advice and assistance were provided by D.Dodds, S .Nickerson, 
and Ann Rollins. 

2. A more detailed interpretation of the characteristics of 
the waveform of the output of a '^^ acceleromoter placed on the 
throat has been given by Henke (ID 74) . 

3. A similar transducer arrangement fo:: detecting acoustic 
coupling to the nar.al cavities has been used by Holbrook 
and Crawford (19 70) . Various other methods of detecting 
degree of palatopharyngeal competence include measureiriant 
of the flow of air tlirough the nose (Lubker and Moll, 19 G5; 
Quiglcy, et al . , 196^<) or the acoustic energy radiated from 
the nostrils (Fletcher, 1970; Shclton, ot al, 1967). 

4. This question is discussed in more detail elsewhere (Stevens, 
et al, 1974). That paper also shows data on measures derived 
from the accelerometer output for groups of normally-hearing 
adults and children, and for a group of deaf children. 

« 
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